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Abstract — This work presents, to the best of our knowledge of 
the literature, the first analytic model to address the performance 
of an LRU (Least Recently Used) implementing cache under non- 
stationary traffic conditions, i.e., when the popularity of content 
evolves with time. We validate the accuracy of the model using 
Monte Carlo simulations. We show that the model is capable 
of accurately estimating the cache hit probability, when the 
popularity of content is non-stationary. 

We find that there exists a dependency between the perfor- 
mance of an LRU implementing cache and i) the lifetime of 
content in a system, ii) the volume of requests associated with it, 
iii) the distribution of content request volumes and iv) the shape 
of the popularity profile over time. 

I. Introduction 

Content caching is today a primitive network management 
operation, while the operation and performance of Content 
Delivery Networks (CDNs) is predicated on understanding on 
how users consume content. 

This argument is given urgency by two main factors. First, 
the continued growth of Internet traffic, especially in the 
mobile context, increases the demand on limited network 
resources JT|, J2). Second, the dominance of multimedia traffic 
is today de-facto Q. Recent studies 12 show that in the US, 
services offering real-time video and audio streaming occupy 
62.5% and 54.7% of peak-period downstream traffic for fixed 
and mobile networks respectively. Similarly in Europe, real- 
time multimedia traffic accounts for 33.5-50% of peak-period 
downstream traffic in fixed networks, while globally, video 
traffic alone is project to account for 55% of all "consumer 
Internet traffic" by 2016 Q. 

This traffic profile poses unique challenges to providing 
users with a reliable Quality of Service (QoS). For instance, 
Liu et. al. Q report that 20% of users experience re-buffering 
when streaming contents, while 14% of users suffer significant 
delays before videos start to play. Furthermore, new network- 
ing paradigms such as ICN (Information-Centric Networking) 
are built on the implicit assumption of ubiquitous content 
caching J4|, 0, such that small caches are co-located with 
routers in order to offset traffic latency. 

Therefore, improving the effectiveness of content caching 
is paramount in aiding to address the problem of scaling the 
network while providing the necessary QoS to users. However, 
the vast majority of the studies on caching assume traffic 
patterns to be time-invariant, i.e., users browse through a 
large static catalogue of contents and make requests according 



to their different static popularity distributions - typically 
assumed to be Zipf. 

In reality however, the popularity of contents varies over 
time and different contents exhibit a wide range of popularity 
evolution patterns J6], Q. Contents tend to differentiate in i) 
when they start to attract user attention, ii) how much attention 
they attract and iii) how long they sustain the attraction. For 
instance, contents related sporting or geo-political events such 
as Olympic videos tend to enjoy a very short lifetime J2), 
reflecting users' immediate interest in the topic. In contrast, 
some YouTube music videos keep on attracting user attention 
many months after being released Q. Clearly, this observed 
behaviour is incompatible with time-invariant popularity mod- 
els, and raises the need for more accurate tools that take into 
consideration the evolution of the popularity of contents over 
time. 

This work presents the first steps in this direction. We 
extend the results of Che et al. J8) to take the time-variant 
popularity of contents explicitly into account, and present an 
approximated model of an LRU (Least Recently Used) caching 
under non-stationary traffic conditions. The accuracy of our 
model is validated against Monte Carlo simulations and shows 
that the non-stationarity of content popularity has a dramatic 
effect on caching performance. We find that even when cache 
sizes are small as in the context of ICNs, i.e., when cache 
dynamics change much faster than the popularity evolution, 
the performance of the cache is largely sensitive to popularity 
dynamics. 

II. LRU UNDER NON- STATIONARY CONDITIONS 

In this section, we first present the assumptions that unpin 
our model (Sec. Ill- At . This is followed by the derivation of the 
cache hit probability under the non-stationary traffic scenario 
(Sec. III-BI) . with specific reference to scenarios with large 
(Sec. III-Q and small cache sizes (Sec. III-DI) . 

A. A simple non- stationary traffic model 

We start by assuming that contents are introduced into 
a catalogue (i.e. uploaded on some server) at random. For 
simplicity, this is taken to be according to a homogeneous 
Poisson process with rate 7. Furthermore, it is assumed that 
individual content popularity evolves (over time) according to 
some predetermined profile. Initially, it is assumed that all 
contents follow the same popularity profile, in Sec. [IV] we 
show that this assumption can be relaxed. 



Let us now consider a generic content m, introduced into 
the catalogue at time r m , and whose popularity evolves over 
time according to: 

\n(t) = V m X(t - T m ) 

where A(i) represents the popularity profile and V m a random 
mark (i.e. a random quantity) associated to content m. Popu- 
larity {X m (t)) in this context represents the instantaneous rate 
at which requests for a given content m arrive at the cache. 

Requests are assumed to form an independent time- 
inhomogeneous Poisson processes and X(t) is taken to be 
an arbitrary function satisfying the following conditions: i) 
(positiveness) X(t) > Vi with A(0+) > 0, ii) (causality) 
X(t) = Vi < 0, iii) (smoothness) X(t) continuous almost 
everywhere, iv) (integrability) / °° X(t)dt = 1. The average 
content lifetime can be computed as L = J °° tX(t)dt. 

Observe that V m represents the expected total number of 
requests (volume) induced by content m during its whole life 
in the system. More specifically, by construction, the total 
number of requests for content m is given by a Poisson 
distribution with an average of V m . We assume that volumes 
of requests for different contents form an i.i.d. sequence of 
random variables distributed around some reference V. We 
denote 4>v{x) = Efe 31 ^] to be the moment generating function 
of V and 4>' v (x) its first derivative. 

Finally, the aggregate process of requests arriving to the 
cache is now by construction a Cox process J5] whose 
stochastic intensity is given by: A(t) = J2 m VmX{t — r m ). 

B. Cache hit probability 

In this section we extend the LRU approximation of Che et. 
al. |H) to our non-stationary traffic model in order to estimate 
the cache hit probability, i.e. the probability that a generic 
request finds the content in the cache. 

Consider a cache capable of storing C distinct contents. Let 
Tc{m) be the time needed for C distinct contents not includ- 
ing m to be requested by users. Tc(m) therefore represents 
the cache eviction time for content m, i.e. after which point 
content m will be evicted from the cache. Che's approximation 
is centred on assuming that the cache eviction time (Tc(m)) 
is deterministic and independent from the selected content 
(m). This assumption has been given a theoretical justification 
in iflOl . where it is shown that, under a Zipf-like static pop- 
ularity distribution, the coefficient of variation of the random 
variable representing Tc(m) tends to vanish as the cache size 
grows. Furthermore, the dependence of the eviction time on m 
becomes negligible when the content catalogue is sufficiently 
large. The arguments given in ifTO) are easily extended to our 
non-stationary traffic model when 7 and C are sufficiently 
large. 

Returning to our non-stationary traffic model, we can now 
state our main result: 

Theorem 1: Consider a cache of size C implementing 
LRU policy, operating under a non-stationary popularity model 
(as introduced in Sec. III-Ab with total stochastic intensity: 



A(t) = V m X(t — r m ). Extending Che's approximation, 
the hit probability is given by: 

r t'v(-Jo C Kr-d)de) 
Ph„ = 1 - / A(r) i — Ut (1) 
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where Tq is the solution to the equation: 

C = 7 1 - <f>v (- ° A(r - e)<wj dr (2) 

and 7 is the rate at which new contents are introduced into 
the catalogue. 

Proof: Proceeding along the same lines as for the sta- 
tionary case (recalled in Appendix [A}, we consider a constant 
cache eviction time Tq- We can now start evaluating the 
probability of finding a given content in the cache at time 
t, conditional on the time it has been introduced into the 
catalogue (r) and its request volume (V). This corresponds 
to the event in which one or more requests for the content 
occur in the time interval [t — Tc,i\. This probability can be 
evaluated as Q as: 

M*|r,V) = l-e- v W<'-*>" (3) 

Now, unconditioning with respect to V in (0, we obtain: 



Pm{t I r) = E v 



1 - e 



- V ft-T c A(0-r)c 



1 - <t> v ( - / X(6- r)d6 

It-Ta 



To evaluate the probability of finding the given content in the 
cache at time t, we uncondition with respect to r and obtain: 
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This result exploits the elementary property of Poisson pro- 
cesses that, when a point falls within a specified interval of 
time, its distribution is uniform over the considered interval. 

Now, as in the case of the stationary popularity scenario, 
for a sufficiently large t, the cache is completely filled with 
contents introduced to the catalogue before t and the number 
of contents in the cache is exactly equal to the size of the 
cache. We can therefore claim: 

C ^ ^ [^{content m in cache|T m <t}^T m <t] 
m 

where r m is the time at which content m is introduced into 
the catalogue, and the sum extends over all the contents in the 
infinite content catalogue. Averaging both terms we obtain: 
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'similarly to | |191 obtained under stationary popularity 



Since the average number of contents introduced to the cat- 
alogue at any time interval of size t is 7, by combining 
with 10, we can evaluate the size of the cache C as: 
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Equation <j6j proves @, and must be solved (numerically) to 
evaluate the eviction time (Tc) given a cache size of C. 

Having defined Tc, we now return to evaluating the hit 
probability for a given content with the parameters (tq, Vq). By 
definition, the request for a given content at time t, generates 
a hit at the cache iff the content is located on the cache. 
Therefore, the probability of a hit is given by: 

Phit(t I T , Vq) = Pin(t I T , Vq) 

Now, to uncondition phit(t | To,Vb) with respect to Vq 
and To, we have to consider that the probability with which 
contents are requested by users is biased toward contents 
with higher instantaneous popularity. Let N(Vq, AV, tq, Ato) 
be the average number of contents that have been generated 
during the interval [to,to + A To) with request volume in 
[Vb,Vb + AVb). Now the probability that a request arrives 
for one of the contents defined above is 

N{V ,AV,to,At ) V X(t - t ) 
X E[V] 

where the second term represents the instantaneous rate orig- 
inated by every considered content. Thus, recalling (0 we 
have: 



Phit(t) = E Tj v 
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By substituting a 



Pbit(t) = / A (a) 



t and {3 = t 
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Thanks to the integrability property of A(t), ([U) is obtained by 
letting t — >• 00 in ©. ■ 

The following corollary sheds some light on the relation 
between C and Tc'- 

Corollary 1: The variables C and Tc satisfy: 

poo i-Tc 

C < jE[V] / / A(t - 6)d9dT = -fE[V}T c (8) 
Jo Jo 



and 



C > -yE[V]T c - 7 



E[V 



21 /-oo / fT c \ 

J I J X{T-6)d6\ dr (9) 



Proof: The inequality in ([8]) is derived for (ff) by exploit- 
ing the inequality 1 + x < e x . In particular we exploit the 
previous inequality to lower bound </>y(— J^ X(t — 6)d6) = 

E[e" f«° Mr~e)d8)} wkh 1 _ E [y] jTc ^ _ q^q ^ 

integral appearing in ©. 

Similarly, the inequality in (0 is obtained by exploiting 
for every x > 0. 
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Note that the upper bound in ([8]l has a simple meaning; as- 
suming that all the requests are referring to different contents, 
the cache size is bounded by the overall number of requests 
7-E[V] during the time interval Tc- In Sec. IH-Dl we will show 
that this bound is a good approximation for C when cache size 
is small. 

To gather more insights on the impact of different param- 
eters on Phi t , we now derive a simplified expression for the 
two extreme scenario regimes of large cache and small cache 
sizes. 

C. Large-cache regime 

A closed form expression for the asymptotic hit probability 
(phit.oo) when the cache C — > 00 can be derived from (Q3 by 
making Tc — > 00. 

Corollary 2: For large cache sizes, 
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Phit,oo = 1 ^7—; = 1 - 



E[V] 



E\e- V ] 



E[V] E[V] 



(10) 



Proof: Consider the limit as Tc — > 00 in the integral 
within (Q}; it holds that: 



X(t)<P' v (- A(t - 9)d9^ dr = 

J \(T)<f> f v (-J X(a)dc?jdT (11) 



Now if we define A(t) = JJ" X(a)da (by construction, A(a) 
is also the primitive of A(a)) and f3 — A(t), by substituting 
(3 into (fTTT i. we obtain: 



X(r)cP' v (-A(T))dT = / M-PW 



^v(O) - = 1 - M-l) (12) 

Finally, ( TTcTb is obtained by using (TT2l i within (HJ. ■ 
Observe that (TlOb depends heavily on the distribution of the 
content request volumes (V), and is completely independent of 
the temporal profile of the popularity (A(t)). This is expected, 
when we consider that as C and Tc grow large, contents are 
never evicted from the cache. In effect, only the first request 
for every content will lead to a cache miss, independently of 
the arrival request pattern. 

The expression (TlOb is exact, since for C — > 00 it can be eas- 
ily proved that Tc (m) — > 00 w.p. 1 . This is obtained exploiting 



by the following properties: i) as C — > oo the conditional hit 
probability for contents originating R > 1 requests tends to 
Phit(R) = 1 — 1/R, and ii) the probability of observing at least 
one request for content m is Pr(i? > 1) = 1 — e Vm . 

The value of C (and consequently Tc) for which y>hit ap- 
proaches Phit,oo> instead heavily depends the popularity profile 
A(i). Indeed it is possible to derive a bound on the difference 
of the hit probability from Phit.cxv 

Corollary 3: 

Phit.oo - Phit,T c < / A(t)<2t 
Proof: Starting from (HJ, we obtain: 
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where we have operated the change of variable a = r — 9. 
By observing that <j)'(x) = E[Ve xV ] < E[V] for any x < 0, 
it is possible to upper bound the right-most term of (TTjt as 
follows: 



A(r) 
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Thanks to ( fT2T > it is also possible to upper bound the left-most 
term: 



To 
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/o E[V] E[F] 

which corresponds to Phit.oo- By combining ( fT4l i with (15[ , we 
get the assert. ■ 

D. Small-cache regime 

Under this regime, we get the following hit probability: 
Corollary 4: For very small cache sizes, we can approxi- 
mate the hit probability as: 

E[V 2 



Put 
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Proof: The expression in (fTST i is obtained from (Q3 by 
assuming J„ Tc A(r — Q)dQ w A(r)Tc <C 1, and approximating 
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Examples of popularity profiles \(t). For all profiles the 

AVERAGE CONTENT LIFETIME IS SET EQUAL TO L. OBSERVE THAT 
/ °° A 2 (r)dr IS THE KEY PARAMETER APPEARING InQD- 
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[V] + xM[V 2 ] for small values of x. 



Furthermore, we can improve the results in Corollary Q~]to 
better approximate the relation between C and Tc as follows: 

C PS 7 E[y]T c (17) 

Following the same reasoning as the proof for (|8), ob- 
serve that, for small values of Tc, Jq° A(t — 6)d6 <c 1. 
Now <f> v (- J Q ° A(r - 9)d6) can be approximated with 1 — 
E[V] Jq C A(t — 6')rf6' and the desired relation is obtained. 

Using (T% , we can now rewrite (fTol as follows: 

E[U 2 ] C 



A 2 (r)dr 



(18) 
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The expression given in dT8l enlightens us to the potentially 
large effect the popularity profile of content has on the 
effectiveness of caching, when cache sizes are small. For 
illustration, if we consider the popularity profiles given in 
Table |U the hit probability is always inversely proportional 
to the average content lifetime (L). But, by comparing the 
third column, it is clear that the actual value depends strongly 
on the shape of the profile. 

III. Numerical validation 

In this section we present: i) the results of applying 
the approximation of LRU under non-stationary traffic (see 
Sec UI-Al i. as given by Theorem [T] ii) the validation of the 
predictions of the model through Monte Carlo simulations of 
a single cache. 

The results presented in this section relate the size of a 
cache (C) to the hit probability (phit) and look at relation 
between these two variables, when varying: i) the average 
content lifetime (L), ii) the average content request volume 
(E|V]), iii) the distribution of content request volumes, iv) the 
shape of the popularity time profile. 

For all the results that follow, we set the content arrival rate 
(7) to 10k contents per day and the content request volume (V) 
is assumed to be distributed according to a Pareto distribution: 
fv{v) = (3V^ in /v 1+ P for v > Vmm- The choice of a Pareto 
distribution is justified by two factors. First, several recent 
measurement studies have shown that the Zipf law is a very 
good approximation of the empirical distribution of long term 
content (video) request volumes ifTOl . ifTTl . Second, a Zipf- 
like distribution of content request volumes with parameter 
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Fig. 1 . Cache hit probability under exponential popularity profile, for different 
values of the average content lifetime L (expressed in days). 
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Fig. 3. Cache hit probability under exponential popularity profile with L 
days for different values of the average content volume E[V] . 
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Fig. 2. Cache hit probability under power law popularity profile with f = 3, 
for different values of the average content lifetime L (expressed in days). 



Fig. 4. Cache hit probability under exponential popularity profile for different 
request volume distributions and average content request volume E[V] = 1.5. 



a = l/(/3 — 1) is obtained when a large number of individual 
content request volumes are independently generated accord- 
ing to Pareto distribution. 

With regard to the different popularity profiles that content 
may display, we consider exponential and power law profiles 
as given in Table U Finally, the parameter ( is used to model 
the different time-dependent popularity profiles (popularity 
shapes). 

Figs. [T]and|2]report the hit probability for different values of 
content lifetime, with respect to using; i) an exponential (see 
Fig. [TJ and ii) power law with parameter £ = 3 (see Fig. |2]i 
popularity profile. In both cases, we set V m i n = 1 and f3 = 3, 
as a consequence, we obtain E[V] = 1.5 requests per content. 

The first point to observe is that the model estimates agree 
strongly with the simulation results for the hit probability in 
all the cases. Second, as predicted by the model, the average 
content lifetime (L) deeply impacts the cache performance. 
Indeed, for a given cache size (C), a given content's hit 
probability increases significantly as its lifetime is reduced. 
In particular, for moderate cache sizes, the hit probability 
is roughly inverse proportional to the content lifetime, as 
predicted by Corollary |4] 

Fig.[5]reports the cache hit probability for different values of 
the average content request volume E[V]. All plots refer to the 
same value of /3 = 3 and different values of V m i n = E[V]^j^-. 



The results here reveal that the volume of hits accumulated by 
content has impact exclusively for large cache sizes, i.e. when 
C > 200k objects, and as predicted by J10I . the cache hit 
probability increases with the volume of content requests. For 
moderate cache sizes (C < 10k), the effect of the hits becomes 
negligible as predicted by (TT~8T >. Indeed, all curves correspond 

E[V 2 ] 



to the same value of 



Fig. 2] reports the cache hit probability for different values 
of the parameter (3, associated to the distribution of content 
request volumes. From the figure we see that the shape of 
requests volumes has a significant impact on the cache hit 
probability. As expected, by decreasing /? (i,e., increasing the 
correspondent parameter a for the associated Zipf law), the 
cache performance is improved. In particular we observe that 
the caching performance become much more sensitive to /3 as 
f3 gets smaller than 2 (i.e., as the corresponding Zipf parameter 
a increases above 1). However, the impact of /? (i.e., a) on 
caching performance does not appear in our scenario to be as 
strong as it does in the classical stationary case, where a sort 
of "phase transition" is observed as a crosses the value 1 IflOl . 

Finally, Fig. [5] reports the cache hit probability for different 
content popularity profiles (i.e, varying Q - while keeping 
the average lifetime as constant at L = 10 days. From the 
figure, we see that the content popularity profile appears to 
have a moderate impact on the caching performance (for small 
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Fig. 5. Cache hit probability under power law popularity profile for different 
values of £ and the same average content lifetime L = 10 days. 

caches) as long as the average content lifetime L is kept 
constant. For the extreme case where the size cache C = 100, 
Phit varies from 0.0032 and 0.001 as ( is decreased from 4 to 
2.2 (see the simulation curves). 

IV. Extension to a Multi-class scenario 

In this section, we consider a more realistic scenario in 
which contents can be partitioned into K classes, such that 
each class is associated with a different popularity profile (t) 
and a different request volume distribution Vfc, for 1 < k < K. 

This generalisation of our traffic model is needed in order to 
capture the variability of the popularity profiles exhibited by 
real contents. Indeed the popularity profile depends heavily on 
the nature of contents, for example, the popularity evolution of 
music videos is typically significantly different to videos con- 
taining sport highlights. However, recent experimental stud- 
ies Q have shown that the popularity evolution of different 
contents can be clustered to relatively few groups exhibiting 
similar temporal popularity profiles. 

We can formalise the multi-class scenario by assuming that 
every generated content (to) is associated with a random mark 
W m taking values in {1, . . . , K}, such that the mark specifies 
the class the content belongs to. Assuming {IVm} to be i.i.d. 
random variables, the total stochastic intensity at time t of the 
request process is given by: 

M*) = E V ™^W m (t ~ T m ) = ^ V m^W m (t - T r „)I {Tm < t } 
rn rn 

Under this assumption, we can now state the following: 

Theorem 2: Consider a cache of size C implementing LRU 
policy, operating under a multi-class non-stationary popularity 
model, with total stochastic intensity: A(t) = J2 m Vm^w m it— 
T m ). Extending Che's approximation, the hit probability is 
given by: 



Fig. 6. Cache hit probability under exponential popularity profile for different 
classes configurations. 
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where Tc is the solution to the equation: 
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The proof for Theorem [2] (not reported here for the sake of 
brevity) follows the same lines of Theorem Q] 

In order to evaluate the multi-class scenario, we partition 
contents into K = 4 classes, each characterised by a different 
popularity profile and lifetime. For simplicity, here, we only 
considered 5 different configurations as specified in Table |TT] 

Fig. [6] reports the cache hit probability for each setup. First, 
from the figure, we see that the model predictions of the 
cache hit probability for different cache sizes align accurately 
with the results of the simulation. Second, Fig. [6] shows 
that the heterogeneity of contents weakly impacts the cache 
performance, and that the cache hit probability increases when 
the average content lifetime (L) of each setup decreases, i.e. 
when the number of contents with fast popularity dynamics 
(belonging to class 1) is large with respect to the rest. In 
fact, as already seen in Fig. [T] contents attracting users' 
attention for very limited periods of time (L = 1 days) are 
largely responsible for improving the cache performance when 
adopting the LRU strategy. 

V. Conclusions 

This work has proposed (and validated with a Monte Carlo 
simulation) a simple but highly accurate approximated model 
for an LRU (Least Recently Used) cache under non-stationary 
traffic conditions. The proposed model is flexible and can 
be easily extended to consider complex and realistic traffic 
scenarios. It has the advantage of being computationally cheap 
when compared with the Monte Carlo simulations - especially 
when cache sizes are large. 

Our results show that caching performance is deeply im- 
pacted by the dynamics resulting from the popularity of 
content. In particular, we find that when cache sizes are small 
as in the case anticipated for ICN networks, the content hit 
probability is largely sensitive to the popularity profile of 
contents. 
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Appendix A 

LRU UNDER STATIONARY POPULARITY 

First we briefly resume Che's approximation of LRU in a 
classical traffic scenario in which every content m, in a finite 
catalog of size M, presents a time-invariant popularity profile. 
More in details, we assume that requests for content in arrive 
at the cache according to a time homogeneous Poisson process 
with intensity A m . Let A = Yl m X m ^ e tne resulting total 
arrival intensity of content requests at the cache. 

Now, thanks to Che's approximation, discussed in Sec. III-BI 
a content to is present in the cache at time t, if and only if a 
time less than T c has passed since the last request for content 
to, i.e., if at least a request for such content has arrived in 
the interval (t — T c ,t]. Since requests arrivals are Poisson, the 
probability ph(rn) that at at least one request has arrived in the 
interval (t — T C) t] is given by: Ph(rn) = 1 — e~ XmTc . Observe 
that ph (to) represents, by construction, also, the hit probability 
for content m, as immediate consequence of PASTA property 
of arrivals. 

Considering a cache of size C, by construction: C = 



y i 



{m in cache} 



When averaging both sides, we obtain: 
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Fig. 7. Hit probability phit vs. cache size for different content popularity 
distributions 



shows the impact of the Zipf's exponent a on the caching 
performance. 

As already well known, the popularity distribution shape 
has a disruptive impact on caching performance. 

For a sufficiently larger than 1, the popularity distribution 
is sufficiently skewed, so that the contribution of the few 
top popular contents to the total traffic is significant. Indeed, 
observe that that H(M) = 6(1) when M grows large, i.e. the 
aggregate contribution of tail contents is marginal. 

For a < 1, instead, only caching a significant portion of the 
huge catalog we can achieve significant cache hit probability. 
In this case H(M) diverges as M — > 00 showing that the 
aggregate contribution of tail contents is dominant. 



C = 5>[I {mincache} ] = 5>h(m) = £(1 - e- x ^). 



By solving the above relationship, we obtain T c , and then the 
average hit probability on the cache as: 



PHt = y^^PmPhjr 

j n 

A. Numerical evaluation 



(19) 



We test the Che's approximation in the stationary popularity 
case. We set a catalog size equal to M = 10 7 contents. Fig. [7] 



